Joint Classification and Regression for Visual Tracking with Fully Convolutional Siamese Networks
نویسندگان
چکیده
Abstract Visual tracking of generic objects is one the fundamental but challenging problems in computer vision. Here, we propose a novel fully convolutional Siamese network to solve visual by directly predicting target bounding box an end-to-end manner. We first reformulate task as two subproblems: classification problem for pixel category prediction and regression object status estimation at this pixel. With decomposition, design simple yet effective architecture based framework, termed SiamCAR, which consists subnetworks: subnetwork feature extraction classification-regression direct prediction. Since proposed framework both proposal- anchor-free, SiamCAR can avoid tedious hyper-parameter tuning anchors, considerably simplifying training. To demonstrate that much simpler achieve superior results, conduct extensive experiments comparisons with state-of-the-art trackers on few benchmarks. Without bells whistles, achieves leading performance real-time speed. Furthermore, ablation study validates various backbone networks, benefit from deeper networks. Code available https://github.com/ohhhyeahhh/SiamCAR .
منابع مشابه
Fully-Convolutional Siamese Networks for Object Tracking
The problem of arbitrary object tracking has traditionally been tackled by learning a model of the object’s appearance exclusively online, using as sole training data the video itself. Despite the success of these methods, their online-only approach inherently limits the richness of the model they can learn. Recently, several attempts have been made to exploit the expressive power of deep convo...
متن کاملConvolutional Regression for Visual Tracking
Recently, discriminatively learned correlation filters (DCF) has drawn much attention in visual object tracking community. The success of DCF is potentially attributed to the fact that a large amount of samples are utilized to train the ridge regression model and predict the location of object. To solve the regression problem in an efficient way, these samples are all generated by circularly sh...
متن کاملDetection and Tracking of Liquids with Fully Convolutional Networks
Recent advances in AI and robotics have claimed many incredible results with deep learning, yet no work to date has applied deep learning to the problem of liquid perception and reasoning. In this paper, we apply fully-convolutional deep neural networks to the tasks of detecting and tracking liquids. We evaluate three models: a single-frame network, multi-frame network, and a LSTM recurrent net...
متن کاملMicroscopy Cell Counting with Fully Convolutional Regression Networks
This paper concerns automated cell counting in microscopy images. The approach we take is to use Convolutional Neural Networks (CNNs) to regress a cell spatial density across the image. This is applicable to situations where traditional single-cell segmentation based methods do not work well due to cell clumping or overlap. We make the following contributions: (i) we develop and compare archite...
متن کاملSiamese Convolutional Networks for Cognate Identification
In this paper, we present phoneme level Siamese convolutional networks for the task of pair-wise cognate identification. We represent a word as a two-dimensional matrix and employ a siamese convolutional network for learning deep representations. We present siamese architectures that jointly learn phoneme level feature representations and language relatedness from raw words for cognate identifi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2022
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-021-01559-4